Bioi INOLOGY ^ tssT--^- ' 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: ^ ^/ ^ 7 9^ ^ ^ ^ 

Source: 0//^<^ 

Date Processed by STIC: ^' / 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 c-mail help: patin21helD@usnto.gOv or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 c-mail help: patin3help@uspto.gOv or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISIFNGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM , ACCESSIBLE TPIROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3 .0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 



http://ww>v.uspto.gov/\veb/offices/pac/checker 



Raw Sequence Listing Error Summary 



E RROR DETECTED SUGGESTED CORRECTION or^rvi^i. n uiyLi>t.iK: i / ^ ' ' j 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS* WHICH WERE INSERTED BY PTO SOFTWARE 



SERIAL NUMBER: 



1 Wrapped Nucleics 
Wrapped Aminos 



The numberAext at the end of each line ^Svrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent '^v^apping." 



2 Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 

3 V^^^ saligned Amino 
Numbering 



10 



11 



12 



The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 



_Non-ASCn The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 

ensure your subsequent submission is saved in ASCII text 

_ Variable Length Sequence(s) ^contain n's or Xaa's representing more than one residue. Per Sequence Rules, 



_Patenthi 2.0 
"bug" 



7 Skipped Sequences 

(OLD RULES) 



each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentin version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences(s) . Normally, Patentin would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ED NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:** response to include the skipped sequences. 



Skipped Sequences 
"(NEW RULES) 



Use of n*s or Xaa's 
"(NEW RULES) 



_In valid <213> 
Response 



Use of<220> 



Patentin 2.0 
"bug" 



Sequence(s) 



missing. If intentional, please insert the following lines for each skipped sequence. 



<210> sequence id number 
<400> sequence id number 
000 

Use of n*s and/or Xaa's have been detected in the Sequence Listing. 

Per 1 .823 of Sequence Rules, use of <220>-<223> is MANDATORY if n*s or Xaa's are present 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown. Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <21 3> response is Unknown or 
is Artificial Sequence 



Sequence(s) _ 



missing the <220> "Feature" and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <21 3> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(See Tederal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentin version 2,0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 
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OIPE 



RAW SEQUENCE LISTING DATE: 06/28/2001 

PATENT APPLICATION: US/09/879,257 TIME: 14:46:42 

Input Set : A:\PTO.txt 

Output Set: N:\CRF3\06282001\l879257.raw 

3 <110> APPLICANT: WAKO PURE CHEMICAL INDUSTRIES, LTD. 
5 <120> TITLE OF INVENTION: Hybrid Enzymes and Use Thereof 
7 <130> FILE REFERENCE: WJ018 
C--> 9 <140> CURRENT APPLICATION NUMBER: US/09/879,257 
C--> 10 <141> CURRENT FILING DATE: 2001-06-12 
12 <160> NUMBER OF SEQ ID NOS : 56 
14 <170> SOFTWARE: Patentln Ver. 2.1 



MotCompW 



^^^^Sette Needed 
corrected O^skeii 



ERRORED 


SEQUENCES 
























1 


<210> SEQ ID NO: 1 






















1 7 


<211> LENGTH: 206 






















1 R 
J. o 


<212> TYPE: 


PRT 






















1 Q 


<213> ORGANISM: Human 






















Z ± 


<40 0> SEQUENCE: 1 
























Gin Thr Asp 


Met Ser Arg 


Lys 


Ala Phe 


Val 


Phe 


Pro 


Lys 


Glu 


Ser 


t? — 

£j - 


— z 


"5 1 
Z J 


1 5 


10 




15 


















Z D 


Thr Ser Tyr 


Val Ser Leu 


Lys 


Ala Pro 


Leu 


Thr 


Lys 


Pro 


Leu 


Lys 


£> — 


— ^ 


z o 


20 


25 




30 


















28 


Phe Thr Val 


Cys Leu His 


Phe 


Tyr Thr 


Glu 


Leu 


Ser 


Ser 


Thr 


Arg 


E- 


-> 


29 


35 


40 


45 


















31 


Tyr Ser lie 


Phe Ser Tyr 


Ala 


Thr Lys 


Arg 


Gin 


Asp 


Asn 


Glu 


He 


E- 


-> 


32 


50 


55 


60 




















34 


lie Phe Trp 


Ser Lys Asp 


He 


Gly Tyr 


Ser 


Phe 


Thr 


Val 


Gly 


Gly 


E- 


-> 


35 


65 


70 75 


80 


















37 


Glu lie Leu 


Phe Glu Val 


Pro 


Glu Val 


Thr 


Val 


Ala 


Pro 


Val 


His 


E- 


-> 


38 


85 


90 




95 


















40 


Cys Thr Ser 


Trp Glu Ser 


Ala 


Ser Gly 


He 


Val 


Glu 


Phe 


Trp 


Val 


E- 


-> 


41 


100 


105 




110 


















43 


Gly Lys Pro 


Arg Val Arg 


Lys 


Ser Leu 


Lys 


Lys 


Gly 


Tyr 


Thr 


Val 


E- 


-> 


44 


115 


120 




125 


















46 


Ala Glu Ala 


Ser lie lie 


Leu 


Gly Gin 


Glu 


Gin 


Asp 


Ser 


Phe 


Gly 


E- 


-> 


47 


130 


135 


140 


















49 


Asn Phe Glu 


Gly Ser Gin 


Ser 


Leu Val 


Gly 


Asp 


He 


Gly 


Asn 


Val 


E- 


-> 


50 


145 


150 


155 




160 
















52 


Met Trp Asp 


Phe Val Leu 


Ser 


Pro Asp 


Glu 


He 


Asn 


Thr 


He 


Tyr 


E- 


-> 


53 


165 


170 




175 


















55 


Gly Gly Pro 


Phe Ser Pro 


Asn 


Val Leu 


Asn 


Trp 


Arg 


Ala 


Leu 


Lys 


E- 


-> 


56 


180 


185 




190 


















58 


Glu Val Gin 


Gly Glu Val 


Phe 


Thr Lys 


Pro 


Gin 


Leu 


Trp 


Pro 




E- 


-> 


59 


195 


200 




205 


















62 


<210> SEQ ID NO: 2 






















63 


<211> LENGTH: 16 






















64 


<212> TYPE: 


PRT 






















65 


<213> ORGANISM: Human 






















67 


<4 00> SEQUENCE: 2 



















no 



on 



r 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/879,257 



DATE: 06/28/2001 
TIME: 14:46:42 



Input Set : A:\PTO.txt 

Output Set: N:\CRF3\06282001\I879257.raw 



68 Asp Met Ser Arg Lys Ala Phe Val Phe Pro Lys Glu Ser Asp Thr Ser 



E--> 69 



15 



E--> 
E--> 



5 10 

72 <210> SEQ ID NO: 3 

73 <211> LENGTH: 27 

74 <212> TYPE: PRT 

75 <213> ORGANISM: Human 

77 <400> SEQUENCE: 3 

78 Leu Val Gly Asp lie Gly Asn Val Asn Met Trp Asp 

79 1 5 10 15 

81 Pro Asp Glu lie Asn Thr lie Tyr Leu Gly Gly 
20 25 
<210> SEQ ID NO: 4 

86 <211> LENGTH: 12 

87 <212> TYPE: PRT 

Human 
4 

Leu Lys Lys Gly Tyr Thr Val Gly Ala Glu Ala Ser 



Phe Val Leu Ser 



82 

85 



E--> 



88 <213> ORGANISM: 
90 <400> SEQUENCE: 
91 

92 1. 5 10 

95 <210> SEQ ID NO: 5 

96 <211> LENGTH: 10 

97 <212> TYPE: PRT 

98 <213> ORGANISM: Human 

100 <400> SEQUENCE: 5 

101 Arg Ala Leu Lys Tyr Glu Val Gin Gly Glu 



E--> 102 



10 



1 5 

105 <210> SEQ ID NO: 6 

106 <211> LENGTH: 486 

107 <212> TYPE: PRT 

108 <213> ORGANISM: Leuconostoc mesenteroides 

110 <400> SEQUENCE: 6 

111 Met Val Ser Glu He Lys Thr Leu Val Thr Phe Phe 
E--> 112 1 5 10 15 

114 Asp Leu Ala Lys Arg Lys Leu Tyr Pro Ser Val Phe 
E--> 115 20 25 30 

117 Lys Gly Tyr Leu Gin Lys His Phe Ala He Val Gly 
E--> 118 35 40 45 

120 Ala Leu Asn Asp Asp Glu Phe Lys Gin Leu Val Arg 
E--> 121 50 55 60 

123 Asp Phe Thr Asp Asp Gin Ala Gin Ala Glu Ala Phe 
E--> 124 65 70 75 80 

126 Ser Tyr Arg Ala His Asp Val Thr Asp Ala Ala Ser 
E--> 127 85 90 95 

<129 Lys Glu Ala He Glu Glu Ala Ala Asp Lys Phe Asp 
E--> 130 100 105 110 

132 Arg He Phe Tyr Met Ser Val Ala Pro Arg Phe Phe 
E--> 133 115 120 125 

135 Lys Tyr Leu Lys Ser Glu Gly Leu Leu Ala Asp Thr 
E--> 136 130 135 140 

138 Leu Met He Glu Lys Pro Phe Gly Thr Ser Tyr Asp 
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PATENT APPLICATION: US/09/879,257 
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Input Set : A:\PTO.txt 

Output Set: N:\CRF3\06282001\I879257.raw 



E- 


-> 


139 


145 


150 




155 






160 


















141 


Leu Gin Asn 


Asp 


Leu Glu 


Asn 


Ala 


Phe 


Asp 


Asp 


Asn 


Gin 


Leu 


Phe 


Arg 


E- 


-> 


142 


165 




170 






175 




















144 


lie Asp His 


Tyr 


Leu Gly 


Lys 


Glu 


Met 


Val 


Gin 


Asn 


He 


Ala 


Ala 


Leu 


E- 


-> 


145 


180 




185 




190 




















147 


Arg Phe Gly 


Asn 


Pro He 


Phe 


Asp 


Ala 


Ala 


Trp 


Asn 


Lys 


Asp 


Tyr 


He 


E- 


-> 


148 


195 




200 




205 






















150 


Lys Asn Val 


Gin 


Val Thr 


Leu 


Ser 


Glu 


Val 


Leu 


Gly 


Val 


Glu 


Glu 


Arg 


E- 


-> 


151 


210 


215 


220 






















153 


Ala Gly Tyr 


Tyr 


Asp Thr 


Ala 


Gly 


Ala 


Leu 


Leu 


Asp 


Met 


He 


Gin 


Asn 


E- 


-> 


154 


225 


230 




235 






240 


















156 


His Thr Met 


Gin 


He Val 


Gly 


Trp 


Leu 


Ala 


Met 


Glu 


Lys 


Pro 


Glu 


Ser 


E- 


-> 


157 


245 




250 






255 




















159 


Phe Thr Asp 


Lys 


Asp He 


Arg 


Ala 


Ala 


Lys 


Asn 


Ala 


Ala 


Phe 


Asn 


Ala 


E- 


-> 


160 


260 




265 




270 




















162 


Leu Lys lie 


Tyr 


Asp Glu 


Ala 


Glu 


Val 


Asn 


Lys 


Tyr 


Phe 


Val 


Arg 


Ala 


E- 


-> 


163 


275 




280 




285 






















165 


Gin Tyr Gly 


Ala 


Gly Asp 


Ser 


Ala 


Asp 


Phe 


Lys 


Pro 


Tyr 


Leu 


Glu 


Glu 


E- 


-> 


166 


290 


295 


300 






















168 


Leu Asp Val 


Pro 


Ala Asp 


Ser 


Lys 


Asn 


Asn 


Thr 


Phe 


He 


Ala 


Gly 


Glu 


E- 


-> 


169 


305 


310 




315 






320 


















171 


Leu Gin Phe 


Asp 


Leu Pro 


Arg 


Trp 


Glu 


Gly 


Val 


Pro 


Phe 


Tyr 


Val 


Arg 


E- 


-> 


172 


325 




330 






335 




















174 


Ser Gly Lys 


Arg 


Leu Ala 


Ala 


Lys 


Gin 


Thr 


Arg 


Val 


Asp 


He 


Val 


Phe 


E- 


-> 


175 


340 




345 




350 




















177 


Lys Ala Gly 


Thr 


Phe Asn 


Phe 


Gly 


Ser 


Glu 


Gin 


Glu 


Ala 


Gin 


Glu 


Ala 


E- 


-> 


178 


355 




360 




365 






















180 


Val Leu Ser 


lie 


He He 


Asp 


Pro 


Lys 


Gly 


Ala 


He 


Glu 


Leu 


Lys 


Leu 


E- 


-> 


181 


370 


375 


380 






















183 


Asn Ala Lys 


Ser 


Val Glu 


Asp 


Ala 


Phe 


Asn 


Thr 


Arg 


Thr 


He 


Asp 


Leu 


E- 


-> 


184 


385 


390 




395 






400 


















186 


Gly Trp Thr 


Val 


Ser Asp 


Glu 


Asp 


Lys 


Lys 


Asn 


Thr 


Pro 


Glu 


Pro 


Tyr 


E- 


-> 


187 


405 




410 






415 




















189 


Glu Arg Met 


He 


His Asp 


Thr 


Met 


Asn 


Gly 


Asp 


Gly 


Ser 


Asn 


Phe 


Ala 


E- 


-> 


190 


420 




425 




430 




















192 


Asp Trp Asn 


Gly Val Ser 


He 


Ala 


Trp 


Lys 


Phe 


Val 


Asp 


Ala 


He 


Ser 


E- 


-> 


193 


435 




440 




445 






















195 


Ala Val Tyr 


Thr 


Ala Asp 


Lys 


Ala 


Pro 


Leu 


Glu 


Thr 


Tyr 


Lys 


Ser Gly 


E- 


-> 


196 


450 


455 


460 






















198 


Ser Met Gly 


Pro 


Glu Ala 


Ser 


Asp 


Lys 


Leu 


Leu 


Ala 


Ala 


Asn 


Gly hsp 


E- 


-> 


199 

201 


465 

Ala Trp Val 


470 

Phe 


Lys Gly 


475 






480 














E- 


-> 


202 

490 


485 

<210> SEQ IE 


NO: 


30 


























491 


<211> LENGTH 


: 1024 


























492 


<212> TYPE: 


PRT 




























493 


<213> ORGANISM: 


Escherichia 


coli 




















495 


<400> SEQUENCE: 


30 


























496 


Met Thr Met 


He 


Thr Asp 


Ser 


Leu 


Ala 


Val 


Val 


Leu 


Gin 


Arg 


Arg 


Asp 




9 



7^ 
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RAW SEQUENCE LISTING DATE: 06/28/2001 

PATENT APPLICATION: US/09/879,257 TIME: 14:46:42 

Input Set : A:\PTO.txt 

Output Set: N:\CRF3\06282001\I879257.raw 

E--> 497 1 5 10 15 

499 Trp Glu Asn Pro Gly Val Thr Gin Leu Asn Arg Leu Ala Ala His Pro \ 
E--> 500 20 25 30 \ 

502 Pro Phe Ala Ser Trp Arg Asn Ser Glu Glu Ala Arg Thr Asp Arg Pro \ 
E--> 503 35 40 45 \ 

505 Ser Gin Gin Leu Arg Ser Leu Asn Gly Glu Trp Arg Phe Ala Trp Phe ] 
E--> 506 50 55 60 

508 Pro Ala Pro Glu Ala Val Pro Glu Ser Trp Leu Glu Cys Asp Leu Pro / 
E--> 509 65 70 75 80 / 

511 Glu Ala Asp Thr Val Val Val Pro Ser Asn Trp Gin Met His Gly Tyr / 
E--> 512 85 90 95 | 

514 Asp Ala Pro rt.e Tyf- Thr Asn Val Thr Tyr Pro lie Thr Val Asn Pro 1 
E--> 515 100 105 110 / 

517 Pro Phe Val Pro Thr Glu Asn Pro Thr Gly Cys Tyr Ser Leu Thr Phe / 
E--> 518 115 120 125 / 

520 Asn Val Asp Glu Ser Trp Leu Gin Glu Gly Gin Thr Arg lie lie Phe / 
E--> 521 130 135 140 

523 Asp Gly Val Asn Ser Ala Phe His Leu Trp Cys Asn Gly Arg Trp Val 
E--> 524 145 150 155 160 

526 Gly Tyr Gly Gin Asp Ser Arg Leu Pro Ser Glu Phe Asp Leu Ser Ala 
E--> 527 165 170 175 

529 Phe Leu Arg Ala Gly Glu Asn Arg Leu Ala Val Met Val Leu Arg Trp 
E--> 530 180 185 190 

532 Ser Asp Gly Ser Tyr Leu Glu Asp Gin Asp Met Trp Arg Met Ser Gly 
E--> 533 195 200 205 

535 lie Phe Arg Asp Val Ser Leu Leu His Lys Pro Thr Thr Gin lie Ser 
E--> 536 210 215 220 

538 Asp Phe His Val Ala Thr Arg Phe Asn Asp Asp Phe Ser Arg Ala Val 
E--> 539 225 230 235 240 

541 Leu Glu Ala Glu Val Gin Met Cys Gly Glu Leu Arg Asp Tyr Leu Arg 
E--> 542 245 1. 250 ' 255 . ~ . 

544 Val Thr Val Ser Leu Trp Gin Gly Glu Thr Gin Val Ala Ser. Gly Thr 
E--> 545 260 265 270 ^ 

547 Ala Pro Phe Gly Gly Glu lie lie Asp Glu Arg Gly Gly' Tyr Ala Asp 
E--> 548 275 280 285 

550 Arg Val Thr Leu Arg Leu Asn Val Glu Asn Pro Lys Leu-. Trp Ser Ala 
E--> 551 290 295 300 

553 Glu lie Pro Asn Leu Tyr Arg Ala Val Val Glu Leu His Thr Ala Asp 
E--> 554 305 310 315 320 

556 Gly Thr Leu lie Glu Ala Glu Ala Cys Asp Val Gly Phe Arg Glu Val 
E--> 557 325 330 335 

559 Arg lie Glu Asn Gly Leu Leu Leu Leu Asn Gly Lys Pro Leu Leu lie 
.E--> 560 340 345 350 

562 Arg Gly Val Asn Arg His Glu His His Pro Leu His Gly Gin Val Met 
E--> 563 355 360 365 

565 Asp Glu Gin Thr Met Val Gin Asp lie Leu Leu Met Lys Gin Asn Asn 
E--> 566 370 375 380 

568 Phe Asn Ala Val Arg Cys Ser His Tyr Pro Asn His Pro Leu Trp Tyr 
E--> 569 385 390 395 400 
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571 


Thr Leu Cys 


Asp Arg Tyr 


Gly 


Leu 


Tyr 


Val 


Val 


Asp 


Glu 


Ala 


Asn 


He 


E- 


-> 


572 


405 


410 






415 




















574 


Glu Thr His 


Gly Met Val 


Pro 


Met 


Asn 


Arg 


Leu 


Thr 


Asp 


Asp 


Pro 


Arg 


E- 


-> 


575 


420 


425 




430 




















577 


Trp Leu Pro 


Ala Met Ser 


Glu 


Arg 


Val 


Thr 


Arg 


Met 


Val 


Gin 


Arg 


Asp 


E- 


-> 


578 


435 


440 




445 






















580 


Arg Asn His 


Pro Ser Val 


He 


He 


Trp 


Ser 


Leu 


Gly 


Asn 


Glu 


Ser 


Gly 


E- 


-> 


581 


450 


455 


460 






















583 


His Gly Ala 


Asn His Asp 


Ala 


Leu 


Tyr 


Arg 


Trp 


He 


Lys 


Ser 


Val 


Asp 


E- 


-> 


584 


465 


470 


475 






480 


















586 


Pro Ser Arg 


Pro Val Gin 


Tyr 


Glu 


Gly 


Gly 


Gly 


Ala 


Asp 


Thr 


Thr 


Ala 


E- 


-> 


587 


485 


490 






495 




















589 


Thr Asp lie 


lie Cys Pro 


Met 


Tyr 


Ala 


Arg 


Val 


Asp 


Glu 


Asp 


Gin 


Pro 


E- 


-> 


590 


500 


505 




510 




















592 


Phe Pro Ala 


Val Pro Lys 


Trp 


Ser 


He 


Lys 


Lys 


Trp 


Leu 


Ser 


Leu 


Pro 


E- 


-> 


593 


515 


520 




525 






















595 


Gly Glu Thr 


Arg Pro Leu 


He 


Leu 


Cys 


Glu 


Tyr 


Ala 


His 


Ala 


Met Gly 


E- 


-> 


596 


530 


535 


540 






















598 


Asn Ser Leu 


Gly Gly Phe 


Ala 


Lys 


Tyr 


Trp 


Gin 


Ala 


Phe 


Arg 


Gin 


Tyr 


E- 


-> 


599 


545 


550 


555 






560 


















601 


Pro Arg Leu 


Gin Gly Gly 


Phe 


Val 


Trp 


Asp 


Trp 


Val 


Asp 


Gin 


Ser 


Leu 


E- 


-> 


602 


565 


570 






575 




















604 


lie Lys Tyr 


Asp Glu Asn 


Gly 


Asn 


Pro 


Trp 


Ser 


Ala 


Tyr 


Gly 


Gly 


Asp 


E- 


-> 


605 


580 


585 




590 




















607 


Phe Gly Asp 


Thr Pro Asn 


Asp 


Arg 


Gin 


Phe 


Cys 


Met 


Asn 


Gly 


Leu 


Val 


E- 


-> 


608 


595 


600 




605 






















610 


Phe Ala Asp 


Arg Thr Pro 


His 


Pro 


Ala 


Leu 


Thr 


Glu 


Ala 


Lys 


His 


Gin 


E- 


-> 


611 


610 


615 


620 






















613 


Gin Gin Phe 


Phe Gin Phe 


Arg 


Leu 


Ser 


Gly 


Gin 


Thr 


He 


Glu 


Val 


Thr 


E- 


-> 


614 


625 


630 


635 






640 


















616 


Ser Glu Tyr 


Leu Phe Arg 


His 


Ser 


Asp 


Asn 


Glu 


Leu 


Leu 


His 


Trp 


Met 


E- 


-> 


617 


645 


650 






655 




















619 


Val Ala Leu 


Asp Gly Lys 


Pro 


Leu 


Ala 


Ser 


Gly 


Glu 


Val 


Pro 


Leu 


Asp 


E- 


-> 


620 


660 


665 




670 




















622 


Val Ala Pro 


Gin Gly Lys 


Gin 


Leu 


He 


Glu 


Leu 


Pro 


Glu 


Leu 


Pro 


Gin 


E- 


-> 


623 


675 


680 




685 






















625 


Pro gIu Ser 


Ala Gly Gin 


Leu 


Trp 


Leu 


Thr 


Val 


Arg 


Val 


Val 


Gin 


Pro 


E- 


-> 


626 


690 


695 


700 






















628 


Asn Ala Thr 


Ala Trp Ser 


Glu 


Ala 


Gly 


His 


He 


Ser 


Ala 


Trp 


Gin 


Gin 


E- 


-> 


629 


705 


710 


715 






720 


















631 


Trp Arg Leu 


Ala Glu Asn 


Leu 


Ser 


Val 


Thr 


Leu 


Pro 


Ala 


Ala 


Ser 


His 


E- 


-> 


632 


725 


730 






735 




















634 


Ala lie Pro 


His Leu Thr 


Thr 


Ser 


Glu 


Met 


Asp 


Phe 


Cys 


He 


Glu 


Leu 


E- 


-> 


635 


740 


745 




750 




















637 


Gly Asn Lys 


Arg Trp Gin 


Phe 


Asn 


Arg 


Gin 


Ser 


Gly 


Phe 


Leu 


Ser 


Gin 


E- 


-> 


638 


755 


760 




765 






















640 


Met Trp lie 


Gly Asp Lys 


Lys 


Gin 


Leu 


Leu 


Thr 


Pro 


Leu 


Arg 


Asp 


Gin 


E- 


-> 


641 


770 


775 


780 






















643 


Phe Thr Arg 


Ala Pro Leu 


Asp 


Asn 


Asp 


He 


Gly 


Val 


Ser 


Glu 


Ala 


Thr 
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E- 


-> 


644 


785 


790 


795 






800 
















646 


Arg lie Asp 


Pro Asn Ala 


Trp 


Val 


Glu 


Arg 


Trp 


Lys 


Ala 


Ala 


Gly 


E- 


-> 


647 


805 


810 






815 


















649 


Tyr Gin Ala 


Glu Ala Ala 


Leu 


Leu 


Gin 


Cys 


Thr 


Ala 


Asp 


Thr 


Leu 


E- 


-> 


650 


820 


82 5 




830 


















652 


Asp Ala Val 


Leu He Thr 


Thr 


Ala 


His 


Ala 


Trp 


Gin 


His 


Gin 


Gly 


E- 


-> 


653 


835 


840 




845 




















655 


Thr Leu Phe 


He Ser Arg 


Lys 


Thr 


Tyr 


Arg 


He 


Asp 


Gly 


Ser 


Gly 


E- 


-> 


656 


850 


855 


860 




















658 


Met Ala lie 


Thr Val Asp 


Val 


Glu 


Val 


Ala 


Ser 


Asp 


Thr 


Pro 


His 


E- 


-> 


659 


865 


870 


875 






880 
















661 


Ala Arg lie 


Gly Leu Asn 


Cys 


Gin 


Leu 


Ala 


Gin 


Val 


Ala 


Glu 


Arg 


E- 


-> 


662 


885 


890 






895 


















664 


Asn Trp Leu 


Gly Leu Gly 


Pro 


Gin 


Glu 


Asn 


Tyr 


Pro 


Asp 


Arg 


Leu 


E- 


-> 


665 


900 


905 




910 


















667 


Ala Ala Cys 


Phe Asp Arg 


Trp 


Asp 


Leu 


Pro 


Leu 


Ser 


Asp 


Met 


Tyr 


E- 


-> 


668 


915 


920 




925 




















670 


Pro Tyr Val 


Phe Pro Ser 


Glu 


Asn 


Gly 


Leu 


Arg 


Cys 


Gly 


Thr 


Arg 


E- 


-> 


671 


930 


935 


940 




















673 


Leu Asn Tyr 


Gly Pro His 


Gin 


Trp 


Arg 


Gly 


Asp 


Phe 


Gin 


Phe 


Asn 


E- 


-> 


674 


945 


950 


955 






960 
















676 


Ser Arg Tyr 


Ser Gin Gin 


Gin 


Leu 


Met 


Glu 


Thr 


Ser 


His 


Arg 


His 


E- 


-> 


677 


965 


970 






975 


















679 


Leu His Ala 


Glu Glu Gly 


Thr 


Trp 


Leu 


Asn 


He 


Asp 


Gly 


Phe 


His 


E- 


-> 


680 


980 


985 




990 


















682 


Gly He Gly 


Gly Asp Asp 


Ser 


Trp 


Ser 


Pro 


Ser 


Val 


Ser 


Ala 


Glu 


E- 


-> 


683 


995 


1000 


1005 




















685 


Gin Leu Ser 


Ala Gly Arg 


Tyr 


His 


Tyr 


Gin 


Leu 


Val 


Trp 


Cys 


Gin 


E- 


-> 


686 


1010 


1015 


1020 




















692 


<210> SEQ ID NO: 31 
























693 


<211> LENGTH: 448 
























694 


<212> TYPE: 


PRT 
























695 


<213> ORGANISM: Escherichia 


coli 


















697 


<400> SEQUENCE: 31 
























698 


Thr Pro Glu 


Met Pro Val 


Leu 


Glu 


Asn 


Arg 


Ala 


Ala 


Gin 


Gly 


Asp 


E- 


-> 


699 


1 5 


10 




15 


















701 


Thr Ala Pro 


Gly Gly Ala 


Arg 


Arg 


Leu 


Thr 


Gly 


Asp 


Gin 


Thr 


Ala 


E- 


-> 


702 


20 


25 




30 




















704 


Leu Arg Asp 


Ser Leu Ser 


Asp 


Lys 


Pro 


Ala 


Lys 


Asn 


He 


He 


Leu 


E- 


-> 


705 


35 


40 


45 




















707 


He Gly Asp Gly Met Gly Asp 


Ser 


Glu 


He 


Thr 


Ala 


Ala 


Arg 


Asn 


E- 


-> 


708 


50 


55 


60 






















710 


Ala Glu Gly Ala Gly Gly 


Phe 


Phe 


Lys 


Gly 


He 


Asp 


Ala 


Leu 


Pro 


E- 


-> 


711 


65 70 75 




80 


















713 


Thr Gly Gin 


Tyr Thr His 


Tyr 


Ala 


Leu 


Asn 


Lys 


Lys 


Thr 


Gly 


Lys 


E- 


-> 


714 


85 


90 




95 


















716 


Asp Tyr Val 


Thr Asp Ser 


Ala 


Ala 


Ser 


Ala 


Thr 


Ala 


Trp 


Ser 


Thr 


E- 


-> 


717 


100 


105 




110 


















719 


Val Lys Thr 


Tyr Asn Gly 


Ala 


Leu 


Gly 


Val 


Asp 


He 


His 


Glu 


Lys 



I 
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E--> 720 115 120 125 

722 His Pro Thr lie Leu Glu Met Ala Lys Ala Ala Gly Leu Ala Thr Gly 
E--> 723 130 135 140 

725 Asn Val Ser Thr Ala Glu Leu Gin Asp Ala Thr Pro Ala Ala Leu Val 
E--> 726 145 150 155 160 

728 Ala His Val Thr Ser Arg Lys Cys Tyr Gly Pro Ser Ala Thr Ser Glu 
E--> 729 165 170 175 

731 Lys Cys Pro Gly Asn Ala Leu Glu Lys Gly Gly Lys Gly Ser lie Thr 
E--> 732 180 185 190 

734 Glu Gin Leu Leu Asn Ala Arg Ala Asp Val Thr Leu Gly Gly Gly Ala 
E--> 735 195 200 205 

737 Lys Thr Phe Ala Glu Thr Ala Thr Ala Gly Glu Trp Gin Gly Lys Thr 
E--> 738 210 215 220 

740 Leu Arg Glu Gin Ala Gin Ala Arg Gly Tyr Gin Leu Val Ser Asp Ala 
E--> 741 225 ^ 230 235 240 

743 Ala Ser Leu Asn Ser Val Thr Glu Ala Asn Gin Gin Lys Pro Leu Leu 
E--> 744 245 250 255 

74 6 Gly Leu Phe Ala Asp Gly Asn Met Pro Val Arg Trp Leu Gly Pro Lys 
E--> 747 260 265 270 

749 Ala Thr Tyr His Gly Asn lie Asp Lys Pro Ala Val Thr Cys Thr Pro 
E--> 750 275 280 285 

752 Asn Pro Gin Arg Asn Asp Ser Val Pro Thr Leu Ala Gin Met Thr Asp 
E--> 753 290 295 300 

755 Lys Ala lie Glu Leu Leu Ser Lys Asn Glu Lys Gly Phe Phe Leu Gin 
E--> 756 305 310 315 320 

758 Val Glu Gly Ala Ser lie Asp Lys Gin Asp His Ala Ala Asn Pro Cys 
E--> 759 325 330 335 

761 Gly Gin He Gly Glu Thr Val Asp Leu Asp Glu Ala Val Gin Arg Ala 
E--> 762 340 345 350 

764 Leu Glu Phe Ala Lys Lys Glu Gly Asn Thr Leu Val He Val Thr Ala 
E--> 765 355 360 ^ - 365 

767 Asp. His Ala His Ala Ser Gin He Val Ala Pro Asp Thr Lys Ala Pro 
E--> 768 370 375 380 

770 Gly Leu Thr Gin Ala Leu Asn Thr Lys Asp Gly Ala Val Met Val Met 
E--> 771 385 390 395 400 

773 Ser Tyr Gly Asn Ser Glu Glu Asp Ser Gin Glu His Thr Gly Gin Leu 
E--> 774 405 410 415 

776 Arg He Ala Ala Tyr Gly Pro His Ala Ala Asn Val Val Gly Leu Thr 
E--> 777 420 425 430 

779 Asp Gin Thr Asp Leu Phe Tyr Thr Met Lys Ala Ala Leu Gly Leu Lys 
E--> 780 435 440 445 

945 <210> SEQ ID NO: 45 

946 <211> LENGTH: 55 

947 <212> TYPE: PRT 

948 <213> ORGANISM: Hepatitis B virus 

950 <400> SEQUENCE: 45 

951 Met Gin Trp Asn Ser Thr Ala Phe His Gin Ala Leu Gin Asp Pro Arg 
E--> 952 1 5 10 15 

953 Val Arg Gly Leu Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly Thr Val 



5 
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E- 


-> 


954 






955 


E- 


-> 


956 






957 


E- 


-> 


958 






961 






962 






963 






964 






966 






967 , 


E- 


-> 


968 






996 






997 






998 






999 






1001 






1002 


E- 


-> 


1003 






1005 


E- 


-> 


1006 






1008 


E- 


-> 


1009 






1011 


E- 


-> 


1012 






1014 


E- 


-> 


1015 






1017 






1020 






1021 






1022 






1023 






1025 






1026 


E- 


-> 


1027 






1067 






1068 






1069 






1070 






1072 






1073 






1075 


E- 


-> 


1076 



20 



35 



25 30 

[ Pro Ala 
40 45 



Pro Asn lie Ala Ser His lie Ser Ser lie Ser Ala Arg 



46 



50 55 

<210> SEQ ID NO: 
<211> LENGTH: 12 
<212> TYPE: PRT 

<213> ORGANISM: Hepatitis B virus 
<400> SEQUENCE: 46 

Asp Pro Arg Val Arg Gly Leu Tyr Phe Pro Ala Gly 

1 5 10 

<210> SEQ ID NO: 49 
<211> LENGTH: 84 
<212> TYPE: PRT 
<213> ORGANISM: Human 
<400> SEQUENCE: 49 

Ala Val Ser Glu lie Gin Phe Met His Asn Leu Gly Lys His Leu Ser 



(I 



10 



15 



Ser Met Glu Arg Val Glu Trp Leu Arg Lys Lys Leu Gin Asp Val His 



20 



25 



30 



Asn Phe Val Ala Leu Gly Ala Ser lie Ala Tyr Arg Asp Gly Ser Ser 




35 



40 



45 



50 



55 



60 



65 



70 



75 



80 



50 



<210> SEQ ID NO: 
<211> LENGTH: 15 
<212> TYPE: PRT 
<213> ORGANISM: Human 
<400> SEQUENCE: 50 

Glu Arg Val Glu Trp Leu Arg Lys Lys Leu Gin Asp Val His Asn 

1 5 10 15 

<210> SEQ ID 
<211> LENGTH :( 26 
TYPE: DN 

ORGANISM: Artificial Sequence 
FEATURE : 



<212> 
<213> 
<220> 
<223> 
<400> 




- 30 s/< 



OTHER INFORMATION: 
SEQUENCE: 54 



Description of Artificial Sequence: Oligonucleotide primer 



ctgcccggtt attattattt ttgacaccag 
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<210> 25 
<211> 32 ^ 



<212> DNA - jO / / ( . 

<213> Artificial Sequence /Pp/S^r^J. <^22^/^ fea /c^/^c^ ^ ^ /c:^ ^ ^ / -t^ 

<220> _ ^ ^r////c/c^/ ^QgaC^c^ 

<400> 25 

ttggatccag caggtacgtc taattcttca ag 32 



Page /O _ 



VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/879,257 



DATE: 06/28/2001 
TIME: 14:46:43 



Input Set : A:\PTO.txt 

Output Set: N:\CRF3\06282001\I879257.raw 

L:9 M:270 C: Current Application Number differs. Replaced Application Number 
L:10 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:23 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:1 
M:332 Repeated in SeqNo=l 

L:69 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 2 
L:79 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:3 
M:332 Repeated in SeqNo=3 

L:92 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:4 
L:102 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 5 
L:112 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 6 
M:332 Repeated in SeqNo=6 

L:438 M:258 W: Mandatory Feature missing, <223> OTHER INFORMATION: 
L:497 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:30 
M:332 Repeated in SeqNo=30 

L:699 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:31 
M:332 Repeated in SeqNo=31 

L:952 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:45 
M:332 Repeated in SeqNo=45 

L:968 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:46 
L:1003 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:49 
M:332 Repeated in SeqNo=49 

L:1027 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:50 
L:1076 M:254 E: No, of Bases conflict, LENGTH : Input : 26 Counted:30 SEQ:54 
L:1076 M:252 E: No, of Seq . differs, <211>LENGTH : Input : 26 Found:30 SEQ:54 
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